Loss function

Results: 478



#Item
21Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation Hamid R. Maei University of Alberta Edmonton, AB, Canada

Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation Hamid R. Maei University of Alberta Edmonton, AB, Canada

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2010-03-02 00:09:06
22Trust Region Policy Optimization  arXiv:1502.05477v4 [cs.LG] 6 Jun 2016 John Schulman JOSCHU @ EECS . BERKELEY. EDU

Trust Region Policy Optimization arXiv:1502.05477v4 [cs.LG] 6 Jun 2016 John Schulman JOSCHU @ EECS . BERKELEY. EDU

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2016-06-06 20:48:19
23JMLR: Workshop and Conference Proceedings vol 40:1–38, 2015  Thompson Sampling for Learning Parameterized Markov Decision Processes Aditya Gopalan

JMLR: Workshop and Conference Proceedings vol 40:1–38, 2015 Thompson Sampling for Learning Parameterized Markov Decision Processes Aditya Gopalan

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2015-07-20 20:08:36
24JIANG et al.: PARAMETRIC TEMPORAL ALIGNMENT FOR FACIAL ACTION TEMPORAL SEGMENTS DETECTION  1 Parametric temporal alignment for the detection of facial action temporal segments

JIANG et al.: PARAMETRIC TEMPORAL ALIGNMENT FOR FACIAL ACTION TEMPORAL SEGMENTS DETECTION 1 Parametric temporal alignment for the detection of facial action temporal segments

Add to Reading List

Source URL: ibug.doc.ic.ac.uk

Language: English - Date: 2015-11-12 06:27:11
25Optimal Threshold Control for Energy Arbitrage with Degradable Battery Storage Marek Petrik IBM T. J. Watson Research Center Yorktown, NY 10598

Optimal Threshold Control for Energy Arbitrage with Degradable Battery Storage Marek Petrik IBM T. J. Watson Research Center Yorktown, NY 10598

Add to Reading List

Source URL: marek.petrik.us

Language: English - Date: 2016-07-14 09:59:52
26A Framework for the Analysis of Self-Con…rming Policies P. Battigalli,a S. Cerreia-Vioglio,a F. Maccheroni,a M. Marinacci,a T. Sargentb a b

A Framework for the Analysis of Self-Con…rming Policies P. Battigalli,a S. Cerreia-Vioglio,a F. Maccheroni,a M. Marinacci,a T. Sargentb a b

Add to Reading List

Source URL: www.tomsargent.com

Language: English - Date: 2016-03-22 14:20:27
27MONFISPOL FP7 project SSHDeliverableUser manual for optimal policy package

MONFISPOL FP7 project SSHDeliverableUser manual for optimal policy package

Add to Reading List

Source URL: www.monfispol.eu

Language: English - Date: 2011-11-16 06:18:26
28MEASUREMENT OF HEARING THREHOLD USING AN AUDIOMETER Theoretical introduction: A precise assessment of hearing function is a frequently performed procedure in otorinolaryngology. It is part of the entrance medical evaluat

MEASUREMENT OF HEARING THREHOLD USING AN AUDIOMETER Theoretical introduction: A precise assessment of hearing function is a frequently performed procedure in otorinolaryngology. It is part of the entrance medical evaluat

Add to Reading List

Source URL: ubi.lf1.cuni.cz

Language: English - Date: 2014-10-05 07:37:18
29The effect of orlistat 120 mg decreased hepatic, renal, or cardiac function, and of concomitant disease or other liver disease, narrow-angle glaucoma, a history of depression, loss of balance or coordination, feeling lig

The effect of orlistat 120 mg decreased hepatic, renal, or cardiac function, and of concomitant disease or other liver disease, narrow-angle glaucoma, a history of depression, loss of balance or coordination, feeling lig

Add to Reading List

Source URL: steam5.com

Language: English - Date: 2016-08-21 10:23:26
30Bias in Natural Actor-Critic Algorithms  Philip S. Thomas  Department of Computer Science, University of Massachusetts, Amherst, MAUSA

Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2012-10-01 18:27:53